INTERACTIVE VIDEO DATA GENERATING SYSTEM AND 

METHOD THEREOF 

BACKGROUND OF THE INVENTION 

Field of Invention 

5 The present invention relates to a video generating system and method, more particularly 

a system and method that can establish link relations between different video contents by 
means of block locations, to obtain interactive video broadcasting. 

Related Art 

Present video media include diverse types such as television, cinema, optical video discs, 
10 etc. Regardless of whether these video media use video information recorded beforehand or 
recorded in real-time, their broadcasting usually is time- sequential. In other words, the user 
can receive the broadcast video file only according to a unidirectional and fixed sequence. In 
addition, the user usually is allowed to perform only simple manipulations on the broadcast 
video sequence, such as fast forward move, reverse move, broadcast, pause, stop, etc. 
15 Presently, the user cannot effectuate any adjustment according to actual content broadcast 
demands. The present state of the art therefore encounters diverse issues such as single video 
broadcast contents, unidirectional broadcast sequence, and low interactivity with the user. 

One cause of the foregoing problems resides in the time-sequence nature of traditional 
video broadcasting. To improve these issues, an interesting approach is to base the 

20 generation of video data on its content, and establish link relations between video files. With 
video files formed with link relations, different types of video switching/broadcasting 
functionality can be performed according to real-time manipulations of the user, who can 
watch different video files. With this method, the video files can be implemented in a wide 
range of application such as video games, advertisement, multimedia, etc. Important 

25 improvements therefore are needed in the video industry, which researches and developments 
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will likely focus on a video generating system and method that can provide more interactivity 
and flexibility in application. 

SUMMARY OF THE INVENTION 

It is therefore an objective of the invention to provide an interactive video generating 
5 system and method that can provide high interactivity with the content of the broadcast video 
file, and thereby overcome the prior problems of unidirectional fixed broadcast sequence. 

According to an embodiment, the interactive video generating system comprises a file 
document database 110, a link display module 120, a selection input module 130, a block 
defining module 140, and a relation generating module 150. 

10 According to another embodiment, an interactive video generating method comprises: (1) 

analyzing position data of a display page frame of a video data selection; (2) performing a 
block locating process in the display page frame; (3) creating a link record of the display page 
frame and saving it; (4) performing a tracking and defining process in a next page frame; and 
(5) generating a relation data document corresponding to the video data selection. 

15 In the system and method of the invention, interactive link relations can be created 

between video files via block locations in the page fames of the video files. Via a user 
selection and/or input manipulation, the block locating process is automatically performed to 
create corresponding link records and corresponding relation data documents. The video 
content therefore can be subjected to reference via the link records in the relation data 

20 documents so that an interactive video broadcast with instantaneous broadcast according to 
desired selections can be obtained. 

By forming interactive video files, the user is more inclined to actively participate in the 
video broadcasting process, which makes it more attractive and flexible for a wide range of 
applications: 

25 Further scope of applicability of the present invention will become apparent from the 
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detailed description given hereinafter. However, it should be understood that the detailed 
description and specific examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only, since various changes and modifications within the 
spirit and scope of the invention will become apparent to those skilled in the art from this 
5 detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will become more fully understood from the detailed description 
given in the illustration below only, and is thus not limitative of the present invention, and 
wherein: 

10 FIG. 1 is a schematic diagram of an interactive video data generating system and method 

according to an embodiment of the invention; 

FIG. 2a is a flowchart of an interactive video data generating method according to an 
embodiment of the invention; 

FIG. 2b is flowchart of a block locating process of a display page frame implemented in 
15 an interactive video data generating method according to an embodiment of the invention; 

FIG. 2c is a flowchart of a tracking and defining process of a video data selection 
implemented in an interactive video data generating method according to an embodiment of 
the invention; 

FIG. 3 is a flowchart of a tracking and defining process implemented to create relation 
20 data documents in an interactive video data generating method according to an embodiment 
of the invention; and 

FIG. 4 is a schematic view of an interactive video-broadcasting interface implemented in 
an interactive video data generating system and method according to an embodiment of the 
invention. 
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DETAILED DESCRIPTION OF THE INVENTION 



The invention provides an interactive video data generating system and method. 
Referring to FIG. 1, an interactive video data generating system comprises the following 
elements. 

5 A file document database 110 stores video files for link relation and relation data 

documents 820 corresponding to the video files. 

The video files can be generated with diverse encoding formats. Link relations can be 
established for different encoding types or formats of video files. The relation data 
documents 820 include a plurality of link records 830. When a video file is being 
1 0 broadcasted, the link records are used for reference according to the user's selection, and then 
are used for selectively broadcasting other corresponding video files or sections. Generally, 
the link records 830 of the relation data documents 820 at least include the following 
elements (FIG. 3 illustrates an example of relation data documents 820): 

(a) page frame data, recording the page frame having a link relation, wherein the page 
1 5 frame data can support a single page frame number (for example, "n" will indicate the n-th 

page frame), or a page frame number range (for example, [n, n+2] refers to all the page 
frames between the page frame number n and the page frame number (n+2); 

(b) block data, recording the block location set as a link relation. Its name can be 
desirably defined by the user; generally, each page frame can have more than one block 

20 location settings at the same time; and 

(c) link data, respectively corresponding to the block data and used for recording video 
files or sections referred to by the link relations of the block locations. 

A link display module 1 20 provides a user interface operable to display tables of video 
files. In addition, the link display module 120 receives event-triggering signals to prompt 
25 selected video items and broadcast display page frames with the corresponding relation data 
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documents 820. 



The user interface provides diverse manipulating options such as selecting the video 
file(s), to be subjected to a link relation. The user interface can display in hierarchical menu 
the video files of the file document database 1 10 or the link records 830 of the relation data 
5 documents 820 for user's selection and manipulation. The whole user interface performs 
displaying according to a "What You See Is What You Get" (WYSIWYG) mode, so that the 
user can control any change and modification made in video file tables or link records 830 of 
the relation data document 820. When the user wants to set a video link relation, a selection 
of video data is to be decided first. The user interface then provides the user with the display 
10 page frame of the video data selected by the user as reference for a link relation. 

A selection input module 130 generates an event-triggering signal according to the user's 
selection/input manipulation, by means of which selection and input of video data and a 
display page frame can be entered in the system. 

The event- triggering signal is created by a user's manipulation. Usually, this 
15 event-triggering signal is created via triggering a sensitive display device, such as a touch 
panel display screen or a pointing positioning device, such as a computer mouse. 

A block defining module 140 proceeds to block locating according to the user's 
selection/input of the display page frames of the video data selection, and generates link 
records 830, stored in the relation data document 820 corresponding to the video data 
20 selection. 

The block locating process performed by the block-defining module 140 comprises the 
following parts: 

(a) an optical flow analysis, where an initial block boundary corresponding to the user's 
selection is evaluated according to similar optical properties; 

25 (b) a feature extraction, where contents of the initial block boundary without similar 
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features are filtered out; and 



(c) a clustering treatment, where adaptive bounding techniques are applied to the 
remaining pixels with similar features to mark up and generate an exact block location. 

According to the block location data generated by the block defining module 140, a 
5 relation generating module 150 then tracks and defines similar block locations in the 
following page frames of the video data selection. The definition results then are added to the 
relation data document 820 corresponding to the video data selection. 

Generally, it may happen that a same block location of a video data selection appears in 
different page frames. To avoid repeating the same block locating manipulation, the relation 

10 generating module 150 according to variations such as 'in direction of movement', locations, 
etc., implements spatio-temporal matching techniques to infer the positions in other page 
frames where the same block location may appear. The relation- generating module 150 then 
defines the block location found in the following page frame according to the link record 830 
of the block location of the previous page frame. All the same block locations in the entire 

15 video data selection thereby are set with the same link relation. 

FIG. 2a is a flowchart of an interactive video data generating method according to an 
embodiment of the invention. 

First, location data of the display page frame corresponding to a video data selection are 
analyzed (step 200). The location data are coordinate data, which are generated when the 

20 user triggers a sensitive display device or a pointing positioning device. According to the 
coordinate data, a block locating process is performed in the display page frames (step 300). 
This block locating process is detailed in FIG. 2b. After the block location has been 
determined, the user selects or inputs the video files or sections to be subjected to a link 
relation. The link records 830 of the display page frame, once being generated, are stored in 

25 the relation data document 820 (step 400). Subsequently, a tracking and defining process is 
performed on the next page frames of the video data selection (step 500), which is detailed in 
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FIG. 2c. After the link definition has been achieved for the block locations found in the 
following page frames, the finally created relation data document 820 is saved up (step 600), 
which completes the interactive video data generating flow. 

FIG. 2b is a flowchart of the block locating process according to an embodiment of the 
5 invention. First, optical flow properties of a block location are determined according the 
position data of the display page frame (step 310), i.e. the optical flow properties at the 
location selected by the user are determined. According to the optical flow properties, an 
initial block boundary is created (step 320), which is performed by using an optical flow 
analysis. A feature extraction then is applied on the initial block boundary (step 330), to 
10 eliminate contents without similar features. Lastly, a clustering treatment is performed (step 
340), using adaptive bounding techniques to mark up remaining pixels with the same features, 
and thereby generate the exact block location (step 350). 

It may happen that a same block location of a video data selection appears in different 
page frames. To avoid repeating the same block locating manipulation, a tracking and 

15 defining process (step 500) is performed on the following page frames of the video data 
selection to define a link record 830 for all the similar block locations in the same video data 
file. FIG. 2c details this tracking and defining process. First, block location information is 
read (step 510). This block location information includes block boundary data and link 
record 830 data. Spatio-temporal techniques then are implemented to track the same block 

20 location in the next page frame (step 520). According to the variations such as 'in movement 
directions', speed, locations, etc., the spatio-temporal techniques infer the position in the next 
page frame where the block location is likely to appear. Then it is determined whether the 
same block location is actually found therein (step 530). If no similar block location is found, 
tracking step 520 continues, otherwise the block location data are resolved (step 540) to 

25 determine the position data of the block location. The newly found block location is defined 
according to the link record 830 previously set by the user (step 550). The above steps 520 to 
550 are repeated until the tracking and a defining process is achieved for the entire video data 
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selection. 

FIG. 3 is a schematic diagram illustrating the tracking and defining process implemented 
to generate the relation data document 830 according to an embodiment of the invention. In a 
display page frame (n) 801, two block locations 811 (HR1) and 812 (HR2) have been 
5 determined. Link record 830 of a corresponding relation data document 820 includes: a link 
file SI of the block location 81 1 (HR1), and a link file Fl of the block location 812 (HR2). If 
the same block locations 811,812 are tracked in a next page frame 802, the definition of the 
two block locations 811, 812 is automatically added to the corresponding relation data 
document 820. The page frame information in the link record 830, originally being n, will be 

10 consequently modified to [n, n+1] to indicate that the link record is suitable for the display 
page frame (n) 801 and the display page frame (n+1) 802. If the same block locations (HR1) 
81 1 and (HR2) 812 are found again in the following page frame (n+2) 803, the definition of 
the block locations similarly is added to the corresponding relation data document 820, as 
illustrated. The page frame information in the link record 830 will be consequently modified 

15 to [n, n+2] to indicate that the link record is suitable for the display page frame (n) 801 
through the display page frame (n+2) 803. Via this recurrent method, the user performs only 
one manipulation and all the same block locations of the video data selection are uniformly 
defined for broadcasting. 

Video broadcasting can be performed via an interactive video-broadcasting interface. As 
20 illustrated in FIG. 4, an interactive video-broadcasting interface according to an embodiment 
of the invention includes a video data relation display area 901, a video-broadcasting area 902, 
and a user's manipulating area 903. The video data relation display area 901 can use a 
multiplicity of levels to show video data relations, so that the user immediately can visually 
appreciate the entire link relation structure between the video data. The video-broadcasting 
25 area 902 broadcasts the video file selected by the user, and the user's manipulating area 903 
receives the user's selection/input as well as other diverse manipulation items for the video 
file. 



It will be apparent to the person skilled in the art that the invention as described above 
may be varied in many ways, and notwithstanding remaining within the spirit and scope of 
the invention as defined in the following claims. 
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